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Abstract. Zcckcndorf proved that every positive integer has a unique representation as a 
sum of non-consecutive Fibonacci numbers. Once this has been shown, it's natural to ask how 
many summands are needed. Using a continued fraction approach, Lekkerkerker proved that 
the average number of such summands needed for integers in [F n> F n +i) is nj (ip 2 + 1) + 0(1), 
where <p = 1+ 2 * s the golden mean. Surprisingly, no one appears to have investigated the 
distribution of the number of summands; our main result is that this converges to a Gaussian 
as n — > oo. Moreover, such a result holds not just for the Fibonacci numbers but many other 
problems, such as linear recurrence relation with non-negative integer coefficients (which is a 
generalization of base B expansions of numbers) and far-difference representations. 

In general the proofs involve adopting a combinatorial viewpoint and analyzing the resulting 
generating functions through partial fraction expansions and differentiating identities. The 
resulting arguments become quite technical; the purpose of this paper is to concentrate on the 
special and most interesting case of the Fibonacci numbers, where the obstructions vanish and 
the proofs follow from some combinatorics and Stirling's formula; see |MW| for proofs in the 
general case. 
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1. Introduction 



The Fibonacci numbers are one of the most well known and studied sequences in mathe- 
matics, as well as one of the most enjoyable to play with. There are books (such as |Kosj ) and 
journals (such as the Fibonacci Quarterly) dedicated to all their wondrous properties. The 
purpose of this article is to review two nice results, namely Zeckendorf 's and Lekkerkerker's 
Theorems, and discuss some massive generalizations. 

Before stating our results, we first set some notation. We label the Fibonacci numbers by 
Fx = 1, F 2 = 2, F 3 = 3, F 4 = 5 and in general F n = F n _x + F n _ 2 ; we'll discuss shortly why it is 
convenient to use this non-standard counting. We let cp denote the golden mean, 1+ 2 V ^ , which 
satisfies ip 2 — (p + 1. 

Zeckendorf (see for instance [Zej ) proved that every positive integer can be written uniquely 
as a sum of non-adjacent Fibonacci numbers. The proof is a straightforward induction. Note, 
though, how important it is that our series begins with just a single 1; if we had two Is then 
the decompositions of many numbers into non-adjacent summands would not be unique. 

In the 1950s, Lekkerkerker |Lek] answered the following question, which is a natural out- 
growth of Zeckendorf's theorem: On average, how many summands are needed in the Zeck- 
endorf decomposition? Lekkerkerker proved that for integers in [F n , F n+ x) the average number 
of summands, as n — > oo, is -^j-j- + 0(1) ~ .276n. 

Of course, one can ask these questions for more general recurrence relations. Zeckendorf's 
result has been generalized to several recurrence relations (see the 1972 special volume on 
representations in the Fibonacci Quarterly, especially |Hot IKe] . as well as |Len] ) . Burger |Bu] 
proved the analogous result for the mean number of summands for a generalization of Fibonacci 
numbers, G n = + G n -2 + • • • + G n -L- There is, of course, another generalization, which 
interestingly does not seem to have been asked. Namely, how are the number of summands 
distributed about the mean for integers in [F n ,F n+1 ). This is a very natural question to ask. 
Both the question and the answer are reminiscent of the Erdos-Kac Theorem [EKj . which 
states that as n — > oo the number of distinct prime divisors of integers on the order of size n 
tends to a Gaussian with mean log log n and standard deviation \/log log n. 

Our main result is that a similar statement about Gaussian behavior holds, not just for the 
Fibonacci numbers, but for the large class of recurrence relations defined below. 

Definition 1.1. We say a sequence {i/ n }^ =1 of positive integers is a Positive Linear Re- 
currence Sequence (PLRS) if the following properties hold: 

(1) Recurrence relation: There are non-negative integers L,cx, ■ ■ ■ ,cl such that 

H n +i = CxH n + ■ ■ • + c L H n+1 _ L , 



with L, ci and cl positive. 
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(2) Initial conditions: H\ = 1, and for 1 < n < L we have 

H n+X = cxH n + c 2 if„_i H h c n Hi + 1. 

We call a decomposition Y^iLi a iH m +i-i of a positive integer N (and the sequence {a«}™ ly ) 
legal if ai > 0, the other a« > ; and one of the following two conditions holds: 

• We have m < L and = Ci for 1 < i < m. 

• There exists s G {0, . . . , L} such that 

ai = ci, a 2 = c 2 , a s -i = c s _i and a s < c s , (1.1) 

a s+ i, . . . , a s+ £ = for some £ > 0, and {bi}™L{ (with hi = a s+ £ + i) is legal. 

If Y^iLi diHm+i-i is a legal decomposition of N, we define the number of summands (of 
this decomposition of N) to be ai + • • • + a m . 

Informally, a legal decomposition is one where we cannot use the recurrence relation to 
replace a linear combination of summands with another summand, and the coefficient of each 
summand is appropriately bounded. For example, if H n+1 = 2H n + 3H n _i + H n _ 2 , then 
H 5 + 2H 4 + 3H 3 + Hi is legal, while H 5 + 2H 4 + 3H 3 + H 2 is not (we can replace 2H 4 + 3H 3 + H 2 
with if 5), nor is 7H^ + 2H 2 (as the coefficient of H§ is too large). Note the Fibonacci numbers 
are just the special case of L = 2 and c\ = c 2 = 1. 

We adopt a probabilistic language to state our main results. 

Definition 1.2 (Associated Probability Space to a Positive Linear Recurrence Sequence). Let 
{H n } be a Positive Linear Recurrence Sequence. For each n, consider the discrete outcome 
space 

fin = {Hn-, H n + 1, H n + 2, H n+ i — 1} (1.2) 

with probability measure 

¥ n {A) = TT ~ tt ? a C ^; (1.3) 

in other words, each of the H n+ \ — H n numbers is weighted equally. We define the random 
variable K n by setting K n (u) equal to the number of summands of uo G fi n in its legal decompo- 
sition. Implicit in this definition is that each integer has a unique legal decomposition; we will 
prove this fact, and thus K n is well-defined. It is also convenient to study K. n = K n — 1; every 
integer in fi n must have at least on H n as a summand; thus K. n is the number of non-forced 
summands in a legal decomposition. 

Our main result is 

Theorem 1.3. If {if n } ra < L 1 is a Positive Linear Recurrence Sequence then every positive integer 
may be written uniquely as a legal sum 'Y^ i aiH i (see Definition Let K n be the random 

variable of Definition \1.2[ Then K[K n ] and Vax(K n ) are of order n, and as n — > 00, K n (and 
K, n ) converges to a Gaussian. 
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Our result first extends Zeckendorf 's and Lekkerkerker's theorems to a large class of re- 
currence relations, and then goes further and yields the distribution of summands tends to a 
Gaussian. The proof has three main ingredients. The first is to adopt a combinatorial point of 
view. Previous approaches to Lekkerkerker's theorem were number theoretic, involving con- 
tinued fractions. We instead view this as a combinatorial problem, namely how many ways 
can we choose elements in a set subject to some restrictions on what may be taken. This 
approach gives us an explicit formula for the number of N in [H n , H n+ i) that have a given 
number of summands. For each interval [H n , H n+ i) (i.e., for each n) we may thus associate a 
probability function p n on [H n ,H n+ i) where p n (k) is the probability of having exactly k + 1 
summandsj^ note p n is the density of the random variable K. n . We then use generating func- 
tions and differentiating identities to obtain tractable formulas for these summand functions 
p n , and prove our claims about the mean and the variance. We conclude by showing that as 
n — > oo the centered and normalized moments of p n tend to the moments of the standard 
normal; by Markov's Method of Moments this yields the Gaussian behavior. 

We can gain a lot of intuition as to why these results are true by looking at the special 
case of L = 1 and c\ = B > 0. For these choices, our sequence is just H n = B n ~ l ; in other 
words, we are looking at the base B decomposition of integers. Zeckendorf 's theorem is now 
clearly true, as every number has a unique representation. Lekkerkerker and the Gaussian 
behavior are now just consequences of the Central Limit Theorem. For example, consider a 
decomposition of an N G [B n , B n+1 ): 

N = a t B n + a 2 B n - 1 + ■ • • + a n+1 B°. 

We have ai G {1, . . . , B — 1} and all other G {0, . . . , B — 1}. We are interested in the 
behavior, for large n, of a\ + • ■ • + a n+1 as we vary over N in [B n , B n+l ). Note for large n 
the contribution of a± is immaterial, and the remaining aj's can be understood by considering 
the sum of n independent, identically distributed uniform random variables on {0, . . . , B — 1} 
(which have mean and standard deviation a/ (B 2 — 1)/ 12). Denoting these by A^, by the 
Central Limit Theorem A% + • • • + A n+ \ converges to being normally distributed with mean 
^Y^n and standard deviation n^/ (B 2 — 1) /12. 

Our approach is quite general, and can handle a variety of related problems. We state just 
one more, which allows us to see some very interesting behavior. 

Recently Alpert [XI] showed that every positive integer can be written uniquely as a sum 
and difference of the Fibonacci numbers {F n }™ =l such that every two terms of the same sign 
differ in index by at least 4, and every two terms of opposite sign differ in index by at least 3; 
we call this the far-difference representation. For example, 

2011 = Fn — F14 + Fs + Fs and 1900 = Fn — F\± — F\q + Fq + F 2 . 



As remarked earlier, we choose to write the density this way as every integer in [H n , H n+ i) must have H n 
in its decomposition, and thus it is more natural to study the number of additional, non-forced summands 
needed. 
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If S n = J2o<n-u<n Fn-Ai — F n + -F„_ 4 + • • • for positive n and otherwise, then for each 
iV G (S n -i, S n ] the first term in its far- difference representation is F n . Note that has the 
empty representation. We can show 

Theorem 1.4. Consider the outcome space Q n = {S n -i + l, S n -\+2, . . . , S n } with probability 
measure P(A) = YuweA s -s — f or ^ c Let K„ and L n be the random variables denoting 
the number of positive and negative Fibonacci summands in the far- difference representation 
(they are well-defined by [Al\). As n — >• oo ; for any real numbers a and b the random variable 
aK n + bL n converges to a Gaussian. The expected value of K n (which is <f)/2 more than that 
of L n ) is ^ + 371 ~ 4 1 13v/ ^ + o(l); the variance of both is of size 15 | Q 2 1 v/ ^ n. K n and L n are 
negatively correlated, with a correlation coefficient of — (21 — 2</?)/(29 + 2(f). Further, K n + L n 
and K n — L n are independent random variables as n — > oo, which implies the total number of 
Fibonacci numbers is independent of the excess of positive to negative summands. 

Unfortunately, our arguments become involved and technical to handle all of the cases 
in Theorems 11.31 and 11.41 In order to highlight the ideas without getting bogged down in 
computations, in this note we concentrate on the most important special case of Theorem 11.31 
namely the Fibonacci numbers, and provided a sketch of some of the arguments in Theorem 
11.41 We first describe our combinatorial perspective, which yields 

Pn(k) = (1-4) 

(remember p n (k) is the probability that an JV 6 [F n ,F n+ i) has exactly k + 1 summands in 
its Zeckendorf decomposition). All of our theorems follow from knowing this density. The 
difficulties in the general case are due to the fact that the corresponding formulas for the 
densities are far more involved; here we can easily determine the behavior by applying Stirling's 
formula. 

We first use our explicit formula for p n (k) to prove Zeckendorf 's theorem, and then sketch 
how we can use it to prove Lekkerkerker's theorem (the mean as well as compute the 
variance a 2 . We then show that as n — > oo, p n {k) converges to a Gaussian with mean fi n and 
variance a\. While technically we could just immediately jump to the limiting behavior of 
the density, it would be very unmotivated not knowing the mean and the variance and either 
choosing the correct values by divine inspiration, or having them fall out of the resulting 
algebra. 

2. Combinatorial Perspective and Zeckendorf's Theorem 

The key input in our analysis is counting the number of solutions to a well known Diophan- 
tine equationo 

Lemma 2.1. 



This problem is also known as the stars and bars problem, the cookie problem, or the simplest case of 
Waring's problem. 
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(1) The number of ways of dividing n identical objects among p distinct people is ("^ ) . 
Equivalently, this is the number of solutions to y\ + • • • + y p — n with each y^ a non- 
negative integer. 

(2) More generally, the number of solutions to y\ + ■ • • + y p = n with y t > q ( each Ci a 
non-negative integer) is ^ n -( c i+ ,,- +cp)+P- 1 ^ _ 

Proof. For (1): The two formulations are clearly equivalent; simply interpret yi as the number 
of objects person i receives. To prove the claimed formula, imagine the m objects are in a row 
and we add p — 1 items at the end. We now have n + p — 1 objects. There is a one-to-one 
correspondence between assigning the n objects to the p people and choosing p — I of n + p— 1 
objects. There are ("p^ 1 ) wa Y s to choose p — 1 of the n + p — 1 items. All the items up to 
the first one chosen go to person 1, then the items up to the second one chosen go to person 
2, and so on. 

For (2): We may write yi = Xi + Ci with each x% > a non- negative integer. Our problem 
is equivalent to 

{x\ + Ci) H V (x p + c p ) = n, 

which becomes 

X\ H h Xp = n - (ci H hCp), Xi > 0, 

whose solution is given by part (1). □ 

There are two parts to Zeckendorf's Theorem: not only does a decomposition exist of any 
positive integer as a sum of non- consecutive Fibonacci numbers, but such a decomposition is 
unique. As our combinatorial approach does require this uniqueness as an input, we provide 
the standard proof below. 

Lemma 2.2 (Zeckendorf's Theorem - Uniqueness of Decomposition). If two sums of non- 
consecutive Fibonacci numbers are equal, then the two sums have the same summands. 

Proof. Assume 

F'm ~\~ ' ' ' ~\~ F nk F mi -\- F m2 -\- • • • -\- F m ^ (2.1) 

where n\ > n 2 > ■ ■ ■ and mi > m 2 > • • • . Without loss of generality we may assume 
F ni > F mi (as otherwise we would just remove some summands). As each decomposition is of 
non-adjacent summands, if we add 1 to the decomposition on the right of (12. ip . the largest it 
can bej is F mi + F m2+ i, which itself is at most F mi+ i. As F ni > F mi , we see that adding 1 to 
the right hand side of (12. ip yields a number at most F ni ; thus 

F'm ~\~ ~ ~ ~ ~\~ F'nk F mi -\- F m2 -\- • • • -\- F me , (2.2) 

contradiction. □ 

We are now in a position to prove the first part of Zeckendorf's Theorem. In the course of 
our proof we will derive (11 .4p . the claimed formula for p n (k). 



3 For example, F 6 + F 4 + F 2 + 1 = F 6 + F 4 + F 3 = F 6 + F 5 = F 7 . 
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Theorem 2.3 (Zeckendorf's Theorem - Existence of Decomposition). Any natural number 
can be expressed as a sum of n on- consecutive Fibonacci numbers. Further, the probability an 
N G [F n ,F n+ i) has exactly k + 1 summands in its Zeckendorf decomposition is ( n ~^~ k ) / F n _i 
(in other words, the density function of the random variable K. n is p n {k) = ( n ~^~ )/F n _i. 

Proof. Consider all N in [F n , F n+ i). The number of integers in this interval is F n+1 — F n = F n _i. 
We claim that each of these F n _i integers can be expressed as a sum of a certain subset of 
{Fi, F 2 , . . . , F n } with the properties that no two consecutive Fibonacci numbers appear in the 
sum and that F n is one of the summands. From the arguments in the proof of Lemma 12. 2[ it 
is clear that F n must appear in this sum since otherwise the sum would be too small to be an 
element of [F n ,F n+ x). 

We now translate the previous claim into the combinatorial formulation of Lemma 12.11 
Suppose we wish to have k summands in addition to the summand F n in our sum, with 
no two summands adjacent. Clearly k < L 22 ^]- Choosing a valid set of k summands, say 
F mi , . . . , F mk , is equivalent to choosing k indices mi, m 2 , . . . , rrik from the set {1,2, ... ,n — 1}, 
with the property that m.j < mj+i — 1 and n — 1 is not chosen (as otherwise we would have 
the adjacent summands F n _i and F n ). 

We may assume k > 0, as there is only one way to choose no additional summands. We define 
the auxiliary sequence yj as follows: y = m x — 1, and for 1 < j < k, yj = rrij+i — rrij — 1 (as 
noted earlier, m^+i = n). For example, with n = 9, k = 3, and the sequence Fx + F 3 + F 6 + Fg, 
we have yo = 0, y\ — 1, y 2 — 2 and y 3 = 2. Note yo is the number of indices before the first 
index chosen, and for j > 0, yj equals the number of unused indices between rrij and 
Clearly we have yo > and yj > 1 for j > 0. Thus we have used k + 1 indices (including 
the required index n associated with the largest summand, F n ) and y + y% + • — h unused 
indices from {1,2, ... ,n}. Hence we have 

(k + l)+y + yi + --- + y k = n. (2.3) 

If we make the change of of variables xq = yo and Xj —yj — 1 for j > 0, then we have Xj > 
for all j with no other constraints on these variables other than they must satisfy the identity 
implicitly given by (j2.3P ; that is 

x H Vx k = n-1 -2k. (2.4) 

In other words, in view of Lemma 12.21 we have a bijection between the set of Zeckendorf 
decompositions with k + 1 summands having F n as its largest summand and the set of all 
non-negative integer solutions to ( 12. 4p . By Lemma |2.1[ the number of solutions to (12. 4p is 
^n-i-2fc+(A+i-i)^ _ ^n-i-ky Thus the number of Zeckendorf decompositions having largest 
summand F n is precisely 

n-1 I 



'n — 1 — k 



E 



(2.5) 



k 

fc=0 v 

which, by a well-known identity for binomial sums, equals F n _i (see Lemma [A. II of Appendix 
|X]for a proof). As remarked, by Lemma I2T21 each one of these sequences gives rise to a distinct 
Zeckendorf sum. Thus the number of Zeckendorf decompositions in the interval [F n ,F n+ i) is 
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equal to F n _i, which is the total number of integers in that interval. As n was arbitrary, and 
these intervals partition the set of natural numbers, we have shown that every natural number 
has a unique Zeckendorf decomposition, and the number of N in [F n , F n+ i) with exactly k + 1 
summands in its decomposition is ( n l k ) ■ In other words, the probability of a number in this 
interval having precisely k+1 summands is ( n ~l~ k ) /F n -\. □ 

3. Lekkerkerker's Theorem and the Variance 

We sketch how our approach easily yields Lekkerkerker's theorem. We only provide a sketch 
as, of course, Lekkerkerker's theorem follows immediately from our proof of the Gaussian 
behavior. We highlight the key steps as Lekkerkerker's theorem is of interest in its own right, 
and it is nice to have a new, elementary proof of it (our proof of the Gaussian behavior will 
involve Stirling's formula, and is different than the arguments below). 

The average number of summands needed in the Zeckendorf decomposition is just 



(l>r, 



2~J (n-l-k\ 

£ (* + !)■ k ] 



k=0 



n-1 



I — - I 

1 , i n — 1 



1 + ^X> 



F n _i f-< V k 



n ~ l k=0 



= l+P- (3-D 
Thus the problem is reduced to computing 

£ w = E k ("~l~ k )- < 3 - 2 > 

We can determine a closed- form expression for S(n) by first showing that it satisfies a certain 
recurrence relation. 

Lemma 3.1 (Recurrence relation for S(n)). We have 

£(n) + £(n-2) = (n-2)F n _ ?J . (3.3) 

The proof follows from straightforward algebra, and is given in Appendix |A] Solving the 
recurrence relation yields 

Lemma 3.2 (Formula for £{n)). We have 

n F 1 

£(n) = ^4 + 0(^-2). (3.4) 
ip z + 1 

The proof follows from using telescoping sums to get an expression for £(n), which is then 
evaluated by inputting Binet's formulau and differentiating identities. Recall Binet's formula 

4 There are many ways of proving Binet's formula. One of the simplest is through generating functions and 
partial fractions; a generalization of this plays a key role of the proof of the general case of Theorem 11.31 (see 
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(with our notation) asserts 

F n = -j=-<p n ^-(1 -v) n , if = — - — • 3.5 

V5 V n 2 

The details are provided in Appendix lAl 

Lekkerkerker's Theorem now follows immediately by substituting the result for S(n) from 
Lemma [3.21 into the equation for the mean, ( 13. ip . Explicitly, 

Theorem 3.3. The average number of non- consecutive Fibonacci summands used in repre- 
senting numbers in [F n , F n+1 ) is 

E[K n ] = fi n = n - \ = n ~ \ = E[JC n ) + 1. (3.6) 

1(J b ip z + 1 5 

A similar calculation shows 

Theorem 3.4. The variance in the number of non- consecutive Fibonacci used in representing 
numbers in [F n ,F n+ i) is 

Var(^ n ) = al = ^= n - | = n - | = V ar(/C„). (3.7) 



4. Gaussian Behavior 

In the last section we computed the mean by evaluating certain combinatorial sums. We 
could similarly derive an explicit formula for the variance, or more generally, any moment. 
We choose instead to analyze the density p n (k) in greater detail, and show that it converges 
pointwise to a Gaussian with mean \x n approximately = an d variance cx^ ~ 5(^+2) • ^ n 
the course of proving this convergence, the mean and the standard deviation naturally fall out 
of the calculation. While this does make the previous section superfluous, we chose to include 
it as it provides an elementary proof of Lekkerkerker (as well as telling us what the mean and 
variance are, which are a great aid in performing the Stirling analysis below@). 

Before delving into the proof of Theorem ll.3[ we provide some evidence by looking at the 
number of summands in the Zeckendorf decomposition for integers in [i^oitb -^2011) (see Figure 
HJ; the fit is visually striking. 



4.1. Preliminaries. The computation is a little cleaner if instead of studying the interval 
[F n ,F n+ i) we instead study [F n+1 , F n+2 ). The density is now 

In— k\ 

Pn + l(k) = -4^. (4.1) 



If we didn't know /x„ and a n we would just keep these as initially free parameters, and then choose the 
values appropriately to ensure the limits below exist. 
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Figure 1. Plot of the probability P2oio(^) of having exactly k + 1 summands 
in the Zeckendorf decompositions of integers in [.^2010,-^2011) (with p n (k) = 
( n ~l~ k ) I F n -i) against the Gaussian with mean /i2oio ~ 555.55 and variance 
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We list some useful expansions: 

AWi 



n 



a 



¥ + 2 

(pn 



n+l 



5(if + 2 



+ 0(1) 
0(1) 



V5 

We will expand ( n ~ k k ) using Stirling's formula, which says for large m that 

1 1 139 



(4.2) 



m m e 



m m e 



l V2 



Tim 



l V2 



urn 



12m + 288m 2 blMQnr 



1 



+ 



l + OI- 

m 



(4.3) 



As the mean and variancej are both of size n, for any fixed e > as n — > 00 there is negligible 
probability of all k with \k — yU„+i| > ?i, 1//2+<: Thus we need only worry about analyzing 
Pn+i(k) for \k — /i n +i| < n l l 2+e . For such k, we have k, n — k and n — 2k will all be of size n 
and hence large, and thus Stirling's formula yields a great approximation. 



While we have not proved the variance is of size n, a similar calculation as that for the mean yields it 
without any trouble; in the interest of space we merely state the result in Theorem 13.31 We could also get the 
right order of magnitude for the variance by the Stirling computation that follows. 

7 This follows by Chebyshev's inequality. As we are on the order of n e standard deviations from the mean, 
the probability is at most 0(l/n 2e ), which tends to zero with n. 
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After some simple algebra, which includes using Binet's formula (see (13. 5p ) for F n , we find 
f n — k\ 1 



Pn+l(k) 



k J F, 



n 



(n-k)\ 1 



k\{n-2k)\ F n 

1 (n - k)- k ^ {1 + 0(^- k )) v/5 



2n (1 + (i)) (n - 2fc)-*4 (l + O(^)) ^ . ^ (i + ( i 

1 + 



27rA; fc+ 5(n-2A;) n - 2fc+ 2 </? • <P 
1 



n 



/ n+1 (fc) 1 + 



n 



It suffices to analyze f n+ i(k) for |/c — < n 1//2+e , as the 0(l/n) term leads to negligible 
changes in the density function. We now split off the terms that exponentially depend on n, 
and write 



fn+i(k) 




[n 



-k) Vb\ / n (n-k) 



n—k 



y k{n - 2k) (p J y k k (n - 2k)" 
= N n (k)S n (k). 

We change variables and replace k with its distance from the mean in units of the standard 
deviation, a n+ \. Thus we write 

k = fi n+1 + xa n+ i, (4.4) 

and 

f n+1 (k)dk = f n+1 (p n+1 + a n+1 x)a n+1 dx. (4.5) 

It is essential that we record dk transforms to a n+1 dx. In these new variables the main action 
occurs at x = 0, and the scale is on the order of 1; in other words, once x is large (such as n e ) 
then we are many standard deviations away and the density is negligible. 

The next few pages are the detailed computation. While the computations are long in 
places, the basic idea (the combinatorial perspective) is straightforward: viewing the problem 
combinatorially yields an explicit density function, whose large n asymptotics follow from 
Stirling's formula. 

4.2. Analysis of N n (k). 



Lemma 4.1. For any e > we have 



12 MURAT KOLOGLU, GENE KOPP, STEVEN J. MILLER, AND YINGHUI WANG 

Let C = so fx n+1 = Cn + 0(1), and recall a 2 +1 = 5 ^ 2 ) + We use a cnan g e of 

variable and some algebra to simplify N n (k). We set 

u = °-^x « 4= « ©(n- 1 / 2 ^), (4.7) 

and find 

7V n (£;) = 



1 / n — k a/5 



27r V - 2k) ip 



1 / I -kin y/E 



y/2^n~\ (fc/n)(l -2fc/ra) <p 



1 1 - + (T„ + ix)/n y/E 



\/2ro V ((AWi + <7n+ix)/n)(l - 2(/i n+ i + cr n+ ia;)/n) ip 



l-C-U y/E f 1+ l 



v^\/ (C + u)(l-2C -2u) ^ v " 

where the last error arises from replacing fi n +i with Cn + 0(1). In fact, as u = 0(n _1 / 2+e ) 
we may drop the it's at the cost of replacing the error term 0(n~ r ) with O^" 1 / 2 " 1-6 ). The 
following relations help simplify our expression for N n (k): 

V + l = V \ C=^—, l-C = ^±±, 1-2C = * 



ip + 2 (p + 2 ip + 2 

Using these, as well as a 2 +1 = 5 ^ 2 ) + 0(1) and ip + 1 = p 2 , we find 



1/ (^ + 1)^ + 2) ^5 i + Q 



^J^.(i + o(^)) 



l 



(l + O^- 1 ^)). 



'n+1 

4.3. Analysis of S n (k). 

Lemma 4.2. For any e > we have S n (k) = exp(— x 2 /2) exp (n _1 / 2+3e ). 
Proof. To understand 

*.(*> = {n - k)n ' k 



k k (n - 2k) n ~ 2k 
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we take logarithms and once again change variables by k = /i n+ i + xa n+ ±. We find 
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log S n (k) = leg I 7 : 1 ( " / '' ) " " 



k k (n - 2k) n ~ 2k 

—n log ip + (n — k) log(n — k) — k log k — (n — 2k) log(n — 2k) 
-nlog if + (n - (ii n+1 + xa n+ i)) log(n - (u n+ i + xa n+1 )) 

~ (Hn+l + XOn+l) l0g(A* n+ i + Xa n+1 ) 

-(n- 2(/i n+1 + xa n+1 )) log(n - 2(/i n+1 + xa n+1 )) 
—n log ip 

xa n+1 



+ (n - (ii n+1 + xa n+1 )) log(n - + log 1 - 

V \ n- fi n+1 

- (// n+ i + xcr n+ i) ( log(/i n+ i) + log ( 1 + X<Jn+1 

V V AWl 

- (n - 2(u n+ i + xa n+1 )) ^log(n - 2// n+ i) + log ^1 - ^ 
— n log 

xcr n+1 



+ (n - (/i n+ i + xo- n+ i)) (^log ( 1 J + log ( 1 - 

- ("n+i + xa n+1 ) log ( 1 + 



n „\ , /_ 2x a n+l \ 



- (n - 2(/x n+1 + xa n+1 )) ( log ( — - - 2 J + log ^1 - —ff- ) . (1.8) 



We may simplify the expression above by replacing any u n+ i inside a logarithm with Cn. 
This is because the resulting Taylor expansion of the logarithms will yield a term of size 1/n 2 . 
The largest this can be multiplied by is n, which leads to an error at most 0(l/n). We 
exponentiate to get S n (k), which leads to an error factor of size 1 — exp(0(l/n)), which as 
n — > oo is just of size 1/n. We have 

_!!__!) = log !i^±i = .ogi-^ + a) = 2io g « + a 



as (1 - C)/C = <p + 1 = (j) 2 . Similarly 



n \ , n - 2/x n+1 , 1-2C ^ / 1 \ , J ^ / 1 
2 = log ™±i = log— — + - = Iog0 + O - 
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Substituting these into (14. 8 j) yields 



logS n (k) + 0[- 

-n log (p + (n - (^ n+ i + x(x n+ i)) ^2 log tp + log ^1 - ^"+,^ 



- (/i n+ i + xa n+ i) log (l + -^^") 



(n - 2(// n+1 + x<r n+1 )) ( log^ + log ( 1 - ^^c )n ) ) ■ ( 49 ) 



We note that the coefficient of the log tp term is zero, so 



log S n (k) + [-) = (n - (iJ, n+ i + xa n+ x)) log ( 1 " H 



n J \ (1 — Gjn 



- (n - 2(/i„ + i + xa n+1 )) log 1 

((1 - C)n - xa n+1 )) log (l - 

- {Cn + xa n+1 ) log (l 



(l-2C)n 
xa n+1 



(l-C)n 
xcr n+1 

Cn 



- ((1 - 2C)n - 2xa n+1 )) log (l - { ^c) n ) ' 

Let u = xa n+ i/n. Note w = 0(n _1 ' 2+€ ), and thus when we expand the logarithms above, 
we never need to keep more than the u 2 terms, as anything further will be small, even upon 
multiplication by n. Expanding gives 

logS^+O^-W 6 ) 

u u 2 \ ( u 



(1 - C)n - — — - un 



1-C 2(1 -Of) \ 1-C 



u u 2 \ (U 



- Cn {c-2&)- m [c 

-d - 2C)n (-ji^ - + 2un (-^_ , . (4 . n) 
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Note all the un terms cancel, and all that survives are the u 2 n terms. Thus 



log + 0(n- 1 / 2+3e ) 



1 111 

+ 



2(1 -C) l-C 2C C 1 — 2(7 1-2C 



u 2 n 



u 2 n 



(4.12) 



2l7-6l7 2 + 4C 3 ' 

As C = -^p2 anc ^ u = xcr n+i/n with o~ 2 +1 = 5 ^" 2 ) + simplifying the above yields 

log^W + O^- 1 ^ = _^ (413) 

and thus exponentiating gives 

S n {k) = exp{-x 2 /2)exp(n- 1/2+3€ ) . (4.14) 

□ 

4.4. Proof of Theorem 11.31 Using the results from the previous subsections (Lemmas 14.11 
and 14. 2\ and the change of variable argument on why a factor of <J n+ i enters), we can now 
prove the convergence to a Gaussian. 

Proof of Theorem \1.3l The only item left to prove in Theorem 11.31 is the convergence to the 
Gaussian. We have 

p n+1 (k)dk = p n+ i (//„ +1 + cr n+1 x) a n+1 dx 

= N n (k)S n (k)a (1 + 0{rT l ' 2+u )) dx 

' ■e-^ 2 -a n+1 (l + 0(n- 1 / 2+3e ))rfx 



V 27ra 'n+i 

1 e- x2 l 2 {l + 0{n- l ' 2+ ^))dx, (4.15) 



V2vr 

which clearly converges to the standard normal as n — > oo. □ 



5. Far-difference representations 

We now consider the problem of the far-difference representations. We are studying the 
number of positive and negative Fibonacci summands in the decomposition of integers in 
[S n , S n+ %), where S n = Ylio< n -n<n Fn-u- Using combinatorial arguments as in Lemma I2TT1 
(which involve solving a variant of this Diophantine problem inside a variant of this Diophantine 
problem), we can come up with a formula for the joint density function p n (k, £) for the number 
of integers in [S n , S n+ %) with exactly k positive Fibonacci summands and exactly £ negative 
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Fibonacci summands. After a lot of algebra, we obtain the formula: 



Pn(k,£) 



E 

r=0 



+ 



fe-1 

k — r 

£-1 
r — 1 

i-\ 

r 



£-l\/n-3(k + £) + 2r-3 
r-2) { k + £ 



n - 3{k + £) + 2r - 2 
k + £ 

n-3(k + £) + 2r 
k + £ 



£-1 
r — 1 



n-3(k + £) + 2r- 1 

k + e 



(5.1) 



If we could get good asymptotics for this formula, we could calculate the limiting density 
directly, but this does not seem feasible. With enough patience, this formula can be used to 
calculate any particular joint moment, but this becomes cumbersome very general. Unlike 
our expression for the standard Fibonacci case, here we have sums and products of binomial 
coefficients that would need to be simplified before we can fruitfully apply Stirling's formula. 
As the generating function technique is able to handle this problem, we invite the reader to 
see |MWj for a complete analysis of this problem. 

6. Conclusion and Future Research 

Our combinatorial viewpoint has allowed us to extend previous work and obtain Gaussian 
behavior for the number of summands for a large class of recurrence relations. This is just 
the first of many questions one can ask. Others, which we hope to return to at a later date, 
include: 

(1) Lekkerkerker's theorem, and the Gaussian extension, are for the behavior in intervals 



m -^n+1 1 



Do the limits exist if we considere other intervals, say [F n + gx(F n ),F n + 



92{F n )) for some functions gx and g-p. If yes, what must be true about the growth rates 
of g x and g-P- 

(2) For the generalized recurrence relations, what happens if instead of looking at YH=x a * 
we study Yli=i m in(l, in other words, we only care about how many distinct if;'s 
occur in the decomposition. 

(3) What can we say about the distribution of the largest gap between summands in the 
Zeckendorf decomposition? Appropriately normalized, how does the distribution of 
gaps between the summands behave? 

Appendix A. Proofs of Combinatorial Identities 
We collect the proofs of the various needed combinatorial identities. 



Lemma A.l. Let F m denote the m Fibonacci number, with Fx — 1, F 2 
and so on. Then 



E 

fc=0 



n 



1 - k 

k 



n-l- 



2, F 3 = 3, F A = 5 
(A.l) 
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Proof. We proceed by induction. The base case is trivially verified, and by brute force checking 
we may assume n > 4. We assume our claim holds for n and must show that it holds for n + 1. 
Note that we may extend the sum to n — 1, as (™ l k ) = whenever k > l 1 ^]- Using the 
standard identity that 



m 



+ 



m + 1 
+ 1 



and the convention that P?) = if I is a negative integer, we find 



E 

fc=0 



n — k 
k 



E 

fc=0 



n — 1 — k 
k-1 



n — 1 — k 
k 



e +e 

fc=l v 7 fc=0 

'n - 2 - (jfc - 1 
k-1 



n — 1 — k 
k 



E 



+ E 

k=0 



n — 1 — k 
k 



k=l 

= Fn-2 + F n -\ 

by the inductive assumption; noting F n _ 2 + F„_i = F n completes the proof. 
Proof of Lemma \3.1[ The lemma follows from straightforward algebra. We have 



S(n) 



E* 

fc=0 

E* 



n — 1 — k 



(n - 1 - fc)! 



^ fc!(n-l-2Jfe)! 



X>-1-*) 

fe=i 



(n-2-Jfe)! 



£(n-2-(*-l)) 



(jfe-l)!(n-l-2Jfe)! 

(n-3 - (Jfe-l) 



fc=i 



(jfe-l)!(n-3-2(ife-l))! 



2>E( 



n — 3 



n — 3 — 



E 



n — 3 — 



(A.2) 



(A.3) 
□ 



= (n-2)F n _ 3 -£(n-2), (A.4) 

which establishes the lemma. Note that we used the binomial identity again (Lemma IA.1)) to 
replace the sum of binomial coefficients with a Fibonacci number. □ 



18 MURAT KOLOGLU, GENE KOPP, STEVEN J. MILLER, AND YINGHUI WANG 

Proof of Lemma \3.2i Consider 



J2(-l) e (£(n-2£)+S(n-2(£+l))) = ^ (-l) e (n - 2 - 2£)F„_ 3 -2£ 
e=o e=o 

= C" 1 )^" " 3 " 2£ ) F n-3-2£ + £ (-l) £ (2£)F n _ 3 _ 2e 

1=0 1=0 

= J2(-l) e (n-3-2£)F n ^ 2e + 0(F n _ 2 ); (A.5) 

1=0 

while we could evaluate the last sum exactly, trivially estimating it suffices to obtain the main 
term (as we have a sum of every other Fibonacci number, the sum is at most the next Fibonacci 
number after the largest one in our sum). 

We now use Binet's formula (see (13. 5p ) to convert the sum into a geometric series. Letting 
<p = 1+ 2 be the golden mean, we have 



V5 



■ ¥ 



V5 



(A.6) 



(our constants are because our counting has Fx — 1, F 2 — 2 and so on). As |1 — (p\ < 1, the 
error from dropping the (1 — term is 0(J2e< n n ) = 0{n 2 ) = o(F n _ 2 ), and may thus safely 
be absorbed in our error term. We thus find 



£(n) 



^= (n - 3 - 2£)(-l)V"- 3 - 2 ' + 0(F„_ 2 ) 



1=0 



n-2 



[n 



-3)^(-^ 2 r-2^£(-^ 



£=0 



£=0 



0(F n _ 2 ). (A.7) 



We use the geometric series formula to evaluate the first term. We drop the upper boundary 
term of (— c/^ 1 ) L a J ) as this term is negligible since cp > 1. We may also move the 3 from the 
n — 3 into the error term, and are left with 



S(n) 



n-2 



V5 

n-2 



n 



1 + f 



-2 



-<p 



1=0 



V5 



n 



2S 



n — 3 




L 2 J 





+ 0(F n _ 2 ) 

+ 0(F„_ 2 ) 



where 



S(m,x) = jxK 

j=0 



(A.8) 



(A.9) 
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There is a simple formula for S(m,x). As 



m r m+l 



T m+i _ i 

£* j = - — r-> (a. 10) 



applying the operator x-^ gives 



, (m + l)x m (x - 1) - (x m+1 - 1) mx m + 2 - (m + l)x m+1 + z 
S(m,x) = g,V = x ^-^ = ^—^ . 

(A.11) 

Taking x = —tp 2 , we see that the contribution from this piece may safely be absorbed into 
the error term 0(F n _ 2 ), leaving us with 

£{n) = Vlf^ + °< F -> = + 0(F - 2) ' (A ' 12) 



Noting that for large n we have F n _i = ^ + 0(1), we finally obtain 

nF w _i 

A more careful analysis is possible; such a computation leads to the exact form for the mean 
given in (I3.6p . □ 



£(n) = ^4 + 0(F n _ 2 ). (A.13) 
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